模式识别与人工智能
Saturday, March 15, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence  2024, Vol. 37 Issue (5): 398-409    DOI: 10.16451/j.cnki.issn1003-6059.202405002
Object Detection, Recognition and Adversarial Defense Current Issue| Next Issue| Archive| Adv Search |
Multi-consistency Constrained Semi-supervised Video Action Detection Based on Feature Enhancement and Residual Reshaping
HU Zhengping1,2, ZHANG Qiming1, WANG Yulu1, ZHANG Hehao1, DI Jirui1
1. School of Information Science and Engineering, Yanshan University, Qinhuangdao 066004;
2. Hebei Key Laboratory of Information Transmission and Signal Processing, Yanshan University, Qinhuangdao 066004

Download: PDF (1239 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  The feature representations of both original data and augmented data in the consistency regularized semi-supervised video action detection method tend to induce discriminative domain bias between two types of data, thereby resulting in inadequate fitting of the discriminative results. To address this issue, a multi-consistency constrained semi-supervised video action detection method based on feature enhancement and residual reshaping is proposed in this paper. Firstly, the basic action feature descriptors are continuously enhanced and encoded in the spatiotemporal dimension to obtain crucial contextual information for video action understanding. Subsequently, a residual feature reshaping module is employed to obtain multi-scale residual information while reshaping the features. To reduce the discriminative bias between different types of data, multiple consistency constraints are applied to the original data and the augmented data from the perspectives of classification features and action localization features, achieving a match between discriminative results and feature representation of the augmented data and the original data. Experimental results on JHMDB-21 and UCF101-24 datasets demonstrate the effectiveness of the proposed method in improving video action detection accuracy under the condition of limited labeled samples and strong competitiveness.
Key wordsSemi-supervised Learning      Video Action Detection      Feature Enhancement      Multiple Consistency Constraints     
Received: 03 April 2024     
ZTFLH: TP391.41  
Fund:National Natural Science Foundation of China(No.61771420), Young Scientist Fund in National Natural Science Foundation of China(No.62001413)
Corresponding Authors: HU Zhengping, Ph.D., professor. His research interests include pattern recognition and video processing.   
About author:: ZHANG Qiming, Master student. His research interests include semi-supervised video action detection. WANG Yulu, Master student. Her research interests include skeleton-based human action recognition. ZHANG Hehao, Ph.D. candidate. His research interests include 3D human pose estimation. DI Jirui, Ph.D. candidate. His research interests include fine-grained action recognition.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
HU Zhengping
ZHANG Qiming
WANG Yulu
ZHANG Hehao
DI Jirui
Cite this article:   
HU Zhengping,ZHANG Qiming,WANG Yulu等. Multi-consistency Constrained Semi-supervised Video Action Detection Based on Feature Enhancement and Residual Reshaping[J]. Pattern Recognition and Artificial Intelligence, 2024, 37(5): 398-409.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202405002      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2024/V37/I5/398
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn